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During the last decade of network research focusing on structural and dynamical properties of networks, the 
role of network users has been more or less underestimated from the bird's-eye view of global perspective. In 
this era of global positioning system equipped smartphones, however, user's ability to access local geometric 
information and find efficient pathways on networks plays a crucial role, rather than the globally optimal path- 
ways. We present a simple greedy spatial navigation strategy as a probe to explore spatial networks. These 
greedy navigators use directional information in every move they take, without being trapped in a dead end 
based on their memory about previous routes. We suggest that the centralities measures have to be modified to 
incorporate the navigators' behavior, and present the intriguing effect of navigators' greediness where removing 
some edges may actually enhance the routing efficiency, which is reminiscent of Braess's paradox. In addition, 
using samples of road structures in large cities around the world, it is shown that the navigability measure we 
define reflects unique structural properties, which are not easy to predict from other topological characteristics. 
In this respect, we believe that our routing scheme significantly moves the routing problem on networks one 
step closer to reality, incorporating the inevitable incompleteness of navigators' information. 

PACS numbers: 89.40.-a, 89.75.Fb, 89.75.-k 



A sociopsychological problem that has turned out to be es- 
pecially suited to methods of physics is human mobility — 
how can we characterize, measure and explain the movement 
of people in their daily lives Q? One piece of the human 
mobility puzzle is how to measure the navigability of cities 
and buildings. How successful we are at finding our way is 
a function both of our cognitive ability, how much relevant 
information we have, and also the spatial organization of our 
surroundings. The methods characterizing the spatial orga- 
nization of cities and buildings can assume either that agents 
have complete information of the environment of their journey 
or that they essentially do random walks without any informa- 
tion. In the former category, there are measures like between- 
ness centrality El O ; in the other category, random walk cen- 
trality (4) or first-passage time 0. The reality, of course, is 
somewhere in between — we always navigate with incomplete 
information (6H2). This information could be better (if we 
have GPS devices or maps) or worse (going back to the cafe- 
teria the second day at work in a big office building), but to 
understand the large-scale patterns of human movements and 
how it is influenced by the spatial organization, we need mod- 
els and measures that incorporate navigation with incomplete 
information. 

The key component of our approach is greedy navigators. 
These are agents in a spatial network [10 ] that travel between 
a start s and target point t. The agents have a sense of spatial 
orientation and a memory of where they have been. Briefly 
speaking, agents move in a direction as close as possible to 
the direction of the target. If they reach a cul-de-sac, they 
backtrack to the previous point where they have an untested 
choice of route. In this Letter, we use a discrete spatial net- 
work formalism, but it should be rather straightforward to ex- 
tend the concepts to a continuous space. We stress that we do 
not try to model people's behavior exactly, but that the greedy 
navigators capture the relative magnitude of deviation from an 



optimal navigation caused by the underlying spatial structure. 
In other words, we surmise the greedy navigators fare worse 
in cities or buildings where humans would easily get lost or 
make unnecessary detours, than in those where it is easy to 
get around. 

There are three main categories of cognitive processes in 
human navigation: the use of spatial cues, computational 
mechanisms, and spatial representations EU [12|. In our 
greedy spatial navigation (GSN) model, we assume the agents 
have a good, large-scale sense of the navigation, but no reli- 
able real or cognitive map. Note that we focus on the spa- 
tial orientation [13 ] rather than the geometric proximity (7]- 
9 ], since the former is more comprehensible in the navigator's 
point of view inside the spatial structures based on the visual 
cue such as landmarks ITT1 . In other words, it is more intuitive 
to think of as "going to the road to the northern side because 
I know that the festival takes place in the northern part of the 
town" than "going to the road which will lead me to the closest 
point to the festival." Furthermore, the geometric-proximity- 
based strategies heavily depend on the choice of points (ver- 
tices or nodes in the language of graph or network), while the 
direction-based strategies are much more robust to the differ- 
ent choices of such points. Therefore, the latter seems to be 
a more reasonable choice, considering the fact that the spatial 
structures are continuous in reality. 

Translated to a spatial network language — where the net- 
work is represented as a graph of N vertices at coordinates 
ri, . . . ,Tn = (x\,y\), . . . , (xN,ym) that are connected by M 
edges — the greedy navigators act as follows. Assume an agent 
stands at a vertex i and wants to travel to t. Let j = r 7 - - r t be 
the vector between vertices i and j and 6j be the angle between 
Vj t f and V;j. The greedy navigator moves to the neighbor j of 
i that has not been visited before and has the smallest 0j. If all 
the neighbors of i have been visited the navigator goes back 
to the vertex from which the navigator arrived to i, which is 



in contrast to the simple greedy navigation based on the geo- 
metric proximity that sometimes fails to reach the target due 
to the lack of such a backtracking process 0H). This proce- 
dure is repeated until t is reached (which will happen eventu- 
ally if G is connected, or, more generally, if i and j belong to 
the same component). We contrast the greedy navigators with 
random navigators that move between the source and target in 
the same way as the greedy ones except that they go to a ran- 
dom neighbor instead of the one with minimal Qj. Essentially 
this is a random depth-first search (DFS) [14]. See Fig.[T]for 
an illustration. 

How can we use greedy navigators to quantify the naviga- 
bility of a map? Let d g (d r ) be the average distance in the 
eyes of the greedy (random) navigators, respectively. More 
precisely, we average, over all pairs of distinct vertices, the 
number of edges in the navigators' paths, d is the average dis- 
tance as usual [the average number of edges for the real short- 
est path navigation (SPN)]. d is the lower bound of d g (d r ), 
which makes the greedy navigability v = d/d g (random navi- 
gability £ = djdy) a natural measure of greedy (random) navi- 
gability of the underlying spatial network, respectively, v or f 
takes values in the interval (0, 1] where fewer detours means a 
higher value. The advantage of using the graph distance is that 
y can be interpreted, roughly, as the fraction of correct choices 
of which road to take at an intersection |[T5l . We first measure 
the navigability of empirical data sets. Two of these maps are 
excerpts of the road networks of the Boston and New York 
City (NYC) fT6l . The roads of the excerpt are chosen to rep- 
resent the major thoroughfares of the downtown areas. Other 
networks are railway networks from Europe ifTTl — a data set 
covering most of western continental Europe and the Swiss 
subnetwork of the former. 

The results for the navigability is shown in Table [I] For 
all the cases, v is significantly larger than f , which can be 
intuitively understood since the real road or railway struc- 
tures are designed by encoding geometric information use- 
ful to GSN. More quantitatively, the real structures show a 
much better greedy navigability compared to a network lay- 
out model for the visualization purpose as the null model O, 
as we can check the cases of Boston and NYC roads in Table U 
(v/f = 4.509 for the real Boston roads and 2.696 for the cor- 
responding null model). One can also check that £ strongly 




FIG. 1: Illustration of three different routing schemes to probe 
navigability-related structures of cities and buildings, (a) shows SPN 
from s to t; (b) shows the route of GSN (the angles are those used 
in the algorithm); (c) illustrates a possible route of a random DFS 
navigation, where the backtracking steps occur at vertices 6 and 9. 
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FIG 2: An example of the GSN pathway from the entrance e in the 
left to the central target r for LCM (184 vertices and 194 edges) is 
shown, where the red arrows indicate the pathway with the back- 
tracking processes represented as blue dotted arrows and the SPN 
path is shown in filled circles. The distances for this specific pair 
of the source and target with different routing strategies (GSN, ran- 
dom DFS, and SPN, respectively) are d g (e — > r) = 87, compared to 
d r {e — > r) = 134.13 (averaged over 10 3 trials with the standard error 
1.067), and rf(e->T) = 52. 

depends on the system size, which will be discussed later in 
a systematic approach based on larger data sets. As an exam- 
ple of the GSN, we show the performance of GSN in case of 
an intentionally delusive layout — namely, a maze. Figure [2] 
shows an example of GSN pathway on the graph representa- 
tion of Leeds Castle Maze (LCM) lfT9l in England, starting 
from the entrance in the left to the central target. 

Contrary to the conventional network centralities based ei- 
ther on the shortest path El O or on the random walk (4l, 
we define centralities based on our GSN and compare those 
with conventional ones in various cases. First, the centrality 
considering the "betweenness" in the pathway, called navi- 
gator centrality n for vertex or edge x is defined as n(x) = 
Ititj cri X j/[N(N- 1)1 where cr ixj = 1 if the GSN path from the 
source vertex i and the target vertex j goes through the vertex 
or edge x in the middle and cr ix j = otherwise. We present the 
Boston and NYC roads where the n values are color coded, 



TABLE I: Properties of five empirical data sets, indicated by the 
performance of routing strategies. See the text for the definition of 
symbols. Null models for Boston and NYC roads are connected sam- 
ples of Erdos-Renyi random graphs 1 18 1 with the same N and M, 
where the geographic layout to guide the GSN navigator is given by 
the Kamada-Kawai algorithm |9 |, and the results averaged over 10 3 
samples are shown. 
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155 


6.82 5.72 30.75 84% 


19% 


Null model 






8.606(9) 3.6758(1) 23.20(1) 43 % 


16% 


NYC 


125 


217 


8.27 6.79 44.39 82% 


15% 


Null model 






11.72(2) 4.0300(1) 33.51(2) 34 % 


12% 


Switzerland 


1613 


1680 


145.14 46.56 769.68 32% 


6% 


Europe 


4853 


5765 


143.69 50.87 2011.93 35% 


3% 


LCM (Fig. [5]) 


184 


194 


62.82 20.65 86.23 33% 


24% 
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FIG. 3: Characteristics of vertices and edges of two empirical road maps from Boston (a)-(e) and NYC (f)-(j). We show scatter plots with 
color-coded density resolving the spatial structure with respect to vertices [(b) and (g)] and edges [(c), (d), (h), and (i)]. We show the relation 
between betweenness b and navigator centrality n [(b), (c), (g), and (h)] and the relation between navigator essentiality e and betweenness b 
[(d) and (i)]. The maps [(a) and (f)] show the natural logarithm of the navigator centrality for the vertices (the lower color bars) and edges 
(the upper color bars), [(e) and (j)] chart the natural logarithm of the absolute values of navigator essentiality [the upper (lower) color bars for 
normal (Braess) edges, respectively]. In (e) and (j) we also highlight some of the representative e values from panels (d) and (i) respectively. 



in Figs. [3ja) and (f). In all the networks including railway 
networks (not shown), we have found that the correlation be- 
tween n and conventional betweenness centrality b (21 (us- 
ing the shortest path only considering topology) are larger for 
vertices than edges, as shown in Figs. [3jb), (c), (g), and (h). 
In other words, while the relative b of vertices is more or less 
similar whether it is defined from the SPN pathways or from 
the GSN pathways, b for the edges is different for those two 
cases. 

To further investigate the properties of edges in terms of 
GSN, we introduce another centrality addressing the essen- 
tiality e for edge / defined as, e(l) = d g [G\{l}] - d g [G] where 
G\{/} refers to the graph with the edge / removed from G, 
which quantifies the average number of additional steps nec- 
essary as the effect from the absence of the edge /, naturally 
implying the edge's importance for GSN. Note that eil) can 
be negative somewhat counterintuitively, which implies that 
the removal of the edge / improves the navigability in terms 
of GSN. The case e(l) < is clearly reminiscent of Braess 's 
paradox lfT6l ; i.e., road closures can sometimes reduce travel 
delays caused by the discrepancy between the user-based op- 
timum and system-level global optimum. Our example, there- 
fore, illustrates an interesting phenomenon happening even to 
a single navigator that only comes from the greedy naviga- 
tion strategy. Here we denote the edges with e > as "nor- 
mal" edges and the edges with e < as "Braess" edges. Fig- 
ures [3je) and (j) show Boston and NYC roads where e values 
are color coded. Again, as in case of n, there is significant 
difference between e and b. Therefore, we conclude that the 
spatial structure of edges indeed acts as a crucial substrate for 
greedy navigators. 

In our example road structures, we observe that the de- 
tailed structure of networks really matters. Figures [3Jd) and 



(i) clearly demonstrate the diversity of road structures with the 
four representative roads for each city. The examples clarify 
that relatively low correlations for e and b stem from those 
roads whose importance is quite different for GSN and SPN. 
Roads in the periphery show, expectedly, low e values, but 
there are some important exceptions in terms of GSN, e.g., 
road 1 in Fig.[3Je) with the large e value. In case of the Braess 
road 2 in Boston [Fig.[3je)], we observe that its absence helps 
the large volume of traffic from the upper left part to avoid en- 
tering the central part to reach the lower right part, and induce 
to take more efficient peripheral roads. Of course, the external 
geographic factors such as rivers, tunnels, bridges, and roads 
with various speed limits are also important in practice. We 
take the simplest approach and assume the geographical con- 
text primarily gives a sense of direction for the navigation, 
and neglect other effects. For future work it would be inter- 
esting to extend our work with other information into other 
navigability functions, e.g., Bureau of Public Roads (BPR) 
function |[T6ll . We also notice that road 3 in Fig.[3je) with the 
largest e value (and the second largest b value) corresponds 
to the Harvard bridge across the Charles River, illustrating the 
case of deducing the crucial infrastructure based solely on the 
geometric positions, without explicit awareness of the river. 

The multiple linear regression results shown in Table |TT] 
demonstrate that predicting e values is not plausible from 
the linear combination of those network and geometric mea- 
sures, with low R 2 values. From the same regression analysis 
on much larger Switzerland and European railways, we ob- 
serve even smaller R 2 values estimated by the 10 4 sampled 
source-target pairs for each removal of edge. Therefore, e 
or the Braessiness is a uniquely measured only by consid- 
ering this greedy behavior of navigators. Finally, we inves- 
tigate whether there is any correlation between navigability 
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TABLE II: Coefficients for the multiple linear regression e = m\b + 
m 2 (length) + m 3 c + m^kikj) for road networks, with some measures 
defined on edges: b, the edge length, the distance c from the midpoint 
of edges to the centroid of vertices, and the product kikj of degrees 
of vertices attached to edges. The statistical significance codes are 
< 0.05, < 0.01, and < 0.001. 



Road 



Boston 



NYC 



mi 

m 2 

m 4 

Multiple R 2 
p value 



6.902 c 
-4.687 x 10" 5fl 
-1.504 x 10" 6 
-8.817 x \0~ 3b 

0.2508 
7.784 x 10" 9 



9.389 c 
-6.141 x I0' 5b 
2.142 x 10" 5a 
-5.653 x 10" 3a 

0.1917 
3.395 x 10" 9 



a < 0.05. 
b < 0.01. 
c < 0.001. 
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FIG. 4: Scatter plots for the v (a) and £ (b) vs the number of vertices 
Af, for the 100 large cities in the world. 



and various socioeconomic indices. We selected the 20 largest 
cities in the United States (U.S.), Europe, Asia, Latin Amer- 
ica, and Africa, respectively (100 cities in total), and used the 
MERKAAT0R program |20l and extracted a representative sam- 
ple of each city (a square of 2km sides). First, we compared v 
and f to the numbers of vertices N, as shown in Fig. [4] There 
is a striking difference between those two cases, where there 
is a clear scaling relationship between f and N [Fig. |4jb)], 
meaning that the random navigation is statistically determined 
by the system sizes. In contrast, the widely scattered points in 
Fig. |4ja) strongly suggests that the numbers of vertices can- 
not predict v at all, in addition to the fact that purely topo- 
logical measures cannot predict e in Table [IT] In this respect, 
the v obviously reflects unique properties of different cities 
with vastly different developmental histories. We could not 
find such measures (or linear combinations of them) — e.g., 
population density, median resident income, fraction of pub- 
lic transit commuters, etc. — showing statistically significant 
correlations with the navigability. Again this leads to the con- 
clusion that different cities have unique properties of naviga- 
bility independent of other socioeconomic factors. One exam- 
ple is the correlation between the navigability and the popu- 
lation change ratio of the 20 cities in the U.S. defined as the 
ratio of the population change between 1960 and 2010 to the 
population in 1960 1211 . We observe a very weak negative 
correlation between v and the ratio [R 2 = 0.09(3)] — too weak 
perhaps for claiming a meaningful conclusion dependence. 



In summary, we have introduced a new routing strategy 
incorporating greedy movement and memory of navigators. 
This strategy, we believe, is a minimal model considering the 
basic concept of human psychology for navigation, namely, 
incomplete navigational information and the memory not to 
be lost. From the results from real-world road and railway 
structures, we demonstrate the important difference in terms 
of centralities for navigation and the fact that there exists the 
celebrated Braess's paradox caused by the navigators' behav- 
ior just equipped with this simple strategy. From the observa- 
tion of correlation profiles for centralities in road structures, 
we have shown that the importance of each element heavily 
depends on the detailed layout of structures. We have fo- 
cused on the final efficiency of the routing processes in this 
work, but the detailed process of GSN, e.g., the relative dis- 
tance toward the target during the routing process or the preva- 
lence of backtracking related to the structural properties of 
roads, can be worthwhile future work. This type of tool — 
linking spatial cognition, the environment, and emergent nav- 
igational properties — can be helpful for urban planners and 
architects 
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